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ABSTRACT 

The Gaia satellite was selected as a cornerstone mission of the European Space Agency 
(ESA) in October 2000 and confirmed in 2002 with a current target launch date of 
2011. The Gaia mission will gather on the same observational principles as hippar- 
COS detailed astrometric, photometric and spectroscopic properties of about 1 billion 
sources brighter than mag V = 20. The nature of the measured objects ranges from 
NEOs to gamma ray burst afterglows and encompasses virtually any kind of stars 
in our Galaxy. Gaia will provide multi-colour (in about 20 passbands extending over 
the visible range) photometry with typically 250 observations distributed over 40 well 
separated epochs during the 5-year mission. The multi-epoch nature of the project will 
permit to detect and analyse variable sources whose number is currently estimated in 
the range of several tens of million, among the detectable sources. 

In this paper, we assess the performances of Gaia in analysing photometric pe- 
riodic phenomena. We first present quickly the overall observational principle before 
discussing the implication of the scanning law in the time sampling. Then from ex- 
tensive simulations one assesses the performances in the recovery of periodic signals 
as a function of the period, signal-to-noisc ratio and position on the sky for simple 
sinusoidal variability. 

Key words: Space vehicles - survey - techniques : miscellaneous - methods : nu- 
merical - variable stars. 



1 INTRODUCTION: THE Gaia MISSION 

The very successful ESA mission Hipparcos was the first 
modern multi-epoch all sky space survey. Although primar- 
ily designed for astrometry the full light (supplemented by 
two chromatic bands of lesser accuracy) photometry proved 
to be a significant addition to the initial goal. About 10% 
of the objects in the final catalogue have information about 
variability, 7% are listed in the periodic and unsolved cat- 
alogues, and 3% (i.e 3794) have been published with their 
light curves. A refined analysis of the occurrence of vari- 
ability through the whole HR diagram was subsequently 
achieved (Eyer & Grenon, 1997). 

The Gaia mission, a cornerstone of the ESA science pro- 
gramme, will dramatically improve this analysis in several 
respects: (i) the photometric precision will be much better, 
in the range of milli-magnitude for single observations (com- 
pared to 0.01 for hipparcos); (u) the number of objects wiU 
be increased by a factor nearly 10 000; (iii) multiband pho- 
tometry (15 to 20 photometric channels in addition to the 
full light) will be available and will permit to identify the 



physical processes responsible for the variable phenomena. 
In addition, there will be an instrument for low resolution 
spectroscopy. The mission selected in 2000 and confirmed 
in 2002 with a modified design is scheduled for a launch in 
mid-2011 for a 5-years sky survey. 

Scientific preparation for the mission involves the par- 
ticipation of some 15 working groups sharing responsibility 
for the simulation, the data processing, the science modelling 
and the instrument optimisation. Extensive simulation pro- 
grams have been developed feeding all the groups with input 
data required to test the processing. In particular the scan- 
ning parameters are now fixed and constitute the basic tool 
to determine the times series. The algorithm used in this 
paper is based on the analytical algorithm built and imple- 
mented by one of us (Mignard, 2001) and used thoroughly 
by the Gaia community. 

Regarding the photometry, as of mid-2005, the choice of 
passbands, and the parameters of the scanning law are now 
fixed. The relative positions of the fields of view may again 
change before the design is frozen by the prime contractor 
next year. We have used the design of the study phase of 
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mid-2003. The measurement errors have been also assessed 
and for the full light magnitude (the so-called G magnitude) 
can be found in (Jordi, 2003). It is better than 0.002 mag for 
a single observation (one transit in the astrometric field) of 
stars brighter than G = 15 and of the order of 0.01 mag for 
the faintest stars (G ~ 20) detectable with Gaia. This gives 
already a rough idea of the wealth of science information 
recoverable on the variable stars, virtually over any spectral 
type or luminosity class. 

The large scale photometric survey will have significant 
value for stellar astrophysics, supplying basic stellar param- 
eters (effective temperatures, metallicity and abundances, 
gravities) together with a virtually unbiased sample of vari- 
able stars of nearly all types, such as eclipsing binaries, pul- 
sating stars with periods ranging from hours to several hun- 
dred days. Much shorter periods could be achieved also from 
the exploitation of the light recorded during the passage of 
the image on individual CCDs of a field of view. This par- 
ticular and very specific sampling is not addressed in this 
paper. Variability search methods will be systematically ap- 
plied to every photometric time series, before the period 
search. Estimated number of variable stars to be detected as 
such is highly uncertain, but crude estimates suggest some 
20 million in total, half of them being periodic variables 
(Eyer & Cuypers, 2000). 



2 THE OBSERVING INSTRUMENT 

Although most technical details can be found on the Gaia 
documentation (e.g Ferryman et al. 2001), there are two 
items particularly relevant to the study of the variable stars, 
as they impact directly on the time sampling. The first is the 
way the sky is scanned, allowing to observe a given region 
of the sky at about 30 to 50 different times during the 5- 
year mission, with an average return in the same direction 
every 6 weeks. The second is concerned with the repeated 
measurements over much smaller time intervals, typically 
between one to ten hours. This sampling is determined by 
the relative positions of the different fields of view, through 
which photometric measurements are carried out. These two 
aspects of the design are discussed in the following. 

The instrumental parameters used in this paper refer 
to the current baseline of the design at the time of writing. 
This is subject to possible change during the study phase, 
without major impact on the main results of this paper, 
regarding the determination of the period of the variables, 
all the more because the simulation is primarily based on 
the data coming from the astrometric fields of view. 

2.1 The scanning law 

Measurements with Gaia are conducted with a scanning 
satellite with two widely separated astrometric fields of view, 
similar in its principle to the solution adopted for Hippar- 
cos. This proved to be a very efficient and reliable solution 
for global astrometry. 

The sky is scanned continuously following a pre-defined 
pattern, with the satellite spin-axis kept at a fixed angle of 



PFOV 




Figure 1. The nominal scanning law of Gaia. The spacecraft 
rotates around the z— axis in 6 hours so that the FOVs sweep 
approximately a circle on the sky. The z— axis is constrained to 
move on a sun-centered cone of 50 degrees half-aperture with a 
period of 70 days, forcing the plane of scan to sway back and 
forth with inclinations between 40 to 140 deg. The axis of the 
cone follows the yearly solar motion. 

50° from the Sun (Fig. 1) and nominally perpendicular to 
each viewing direction. A sun-centered precession motion of 
the spin axis in ~ 70 days allows a rather uniform coverage 
of the full sky every six months, yielding repeated observa- 
tions of every region of the sky. The scan period is six hours 
amounting to a displacement rate of the stellar images on 
the CCDS of 60 arcsec/s. 

2.2 The photometric fields of view 

The Gaia payload is fitted with two main telescopes with an 
aperture of 1.4 x 0.5 m^, primarily dedicated to astrometry 
and full light photometry. On each field, several columns 
of CCDS are covered with broad-band filters. This broad- 
band photometric field provides multi-colour, multi-epoch 
photometric measurements for each object observed in the 
astrometric field, for chromatic correction and astrophysical 
purposes. Four or five photometric bands should be imple- 
mented within each instrument with an integration time of 
~ 3.3 s per CCD. A third telescope with a square entrance 
pupil of 0.5 X 0.5 ve? collects starlight for the spectrometer 
and the medium-band photometer. 



3 SEQUENCES OF OBSERVATION 

Due to the combination of the quick spin motion over six 
hours and that of precession over 70 days coupled with the 
annual solar motion, the sampling of the epochs at which 
a particular object is measured is very peculiar and quite 
irregular. However, irregular does not mean random, and 
there are some basic recurring patterns which apply to each 
sequence. One must consider separately the succession of 



© 2004 RAS, MNRAS 000, 1-?? 



Period recovery success rate expected with Gaia 3 



Table 1. Fundamental sampling intervals following from the Gaia 
scanning law and from the location of the viewing directions. 

Transitions from an initial field to a final field arc considered with 
the interval given in days. PFOV; preceding astrometric field of 
view, FFOV: following astrometric field of view, MBP: Medium 
band photometer field of view. 



Initial field 


Final field 


Interval 
hours 


PFOV 


FFOV 


1.78 


PFOV 


MBP 


5.38 


PFOV 


PFOV 


6.00 


FVOV 


MBP 


3.60 


FFOV 


PFOV 


4.22 


FFOV 


FFOV 


6.00 


MBP 


PFOV 


0.62 


MBP 


FFOV 


2.40 


MBP 


MBP 


6.00 



observations over period less than ~ 1-2 day due to the re- 
peated passages over few consecutive rotations of the satel- 
lite and arc primarily governed by the spin and the focal 
plane layout, and the return in a particular direction after 
several weeks without observations, controlled by the preces- 
sion motion. So a complete sequence divides itself naturally 
in succession of epochs, widely separated in time, in which 
there are a small number (between ~ 1 to 10) of observa- 
tions in the cistrometric or photometric fields of view. For a 
regular sampling the ability to carry out accurate frequency 
analyses of variable stars depends on the succession of re- 
peated observations over short time-scale, that is to say on 
sequences of successive revolutions separated by returns of 
the viewing direction to the same area of the sky. However 
for a purely random sampling one can obtain alias-free re- 
sults oven for frequencies much higher than the inverse of 
the smallest intervals between observations and one should 
not worry too much about the time gaps between succes- 
sive observations. But Gaia sampling is obviously not per- 
fectly regular but also falls short from a purely random sam- 
pling. This is something intormodiato with regular patches 
over short time-scales repeating with much irregularity over 
longer time-scales. At the end the combination of the two 
interleaved sampling proved important to achieve an am- 
biguous and accurate frequency retrieval. 

The variety of short term intervals likely to occur within 
a sequence of observations is given in Table 1. The high- 
est temporal resolution is obtained by combining successive 
crossings in the MBP field with a photometric measurement 
obtained in the astrometric preceding field, leading to a min- 
imum interval of 38 minutes. 



3.1 Short term succession 

3.1.1 Astrometric fields 

Let a star be observed in the preceding astrometric field, at 
some reference time t — 0. The star is located somewhere in 
the field of view of width 0.66 degree in the cross-scan direc- 
tion. We use a local rectangular frame with x-axis along the 
scan direction and y-axis in the transverse direction. Take 
the star image at the middle of the field for this reference 
time, that is to say j/ = 0. Six hours later this viewing direc- 
tion is back in nearly the same region of the sky. However 
the pole of the scan circle has moved by nearly 1 degree on 
the sky, and the ordinate of the star in the field can now be 
as large as 1 degree. In this case the star is no longer in the 
field of view and cannot be re-observed. On the other hand 
if the star happens to lie very close to one of the two sta- 
tionary points of the scan circle (over a period of few days 
the scan of the sky can be described by the slow motion of a 
great circle, implying that there arc two nodes on this circle 
which are stationary) the star has virtually not moved on 
the field and will be observed again after one revolution, and 
this situation could recur during four or five rotations of the 
satellite, supplying as many observations in the preceding 
FOV. 

Consider now the transition between the preceding and 

following astrometric FOv's. They are separated by 106 de- 
grees, meaning that it takes 106 mirmtes to rotate from one 
field to the next. During this interval the apparent trans- 
verse motion of the stax with respect to the scan circle is 
at most of 0.3 degrees. Therefore a star observed at a ran- 
dom position in the preceding field will be observed in the 
following field with a ~ 0.5 probability. The average trans- 
verse motion being only 2/7r x 0.3 = 0.19 degree, this makes 
the odds more favourable for the observations to be grouped 
at least by pair. With smaller transverse motion (star closer 
to the instantaneous node) one may have four or five suc- 
cessive pairs of observations, each separated by 0.25 day. 
During these crossings there are photometric measurements 
in full light and in the five broad band filters as well. 

We can do a simple probabilistic modelling to assess 
the probability that an observation in a particular field is 
isolated (sequence of length 1) or followed by at least one 
observation in the next available field of view. It is reason- 
able to assume that the first observation of a sequence may 
occur at any ordinate in the field (this is a crude approxima- 
tion, as the first observation of a sequence is more likely to 
occur near the upper or lower edge of the field) . The chance 
of having the star observable in the next field depends on the 
transverse velocity and on the interval of time between the 
two fields. Let Vt be the transverse velocity and At the inter- 
val of time. One has At = 106 minutes from preceding FOV 
(PFOV) to following FOV (ffov), 254 mn from FFOV to PFOV 
and 360 mn = 0.25 day for one revolution. The transverse 
velocity depends very simply on the longitude (p of the opti- 
cal axis on the instantaneous scan circle and takes the form 
Vt = Vmax am{<p + a) with Vmax = 0.170 deg/h and where 
<f> is & fast moving angle (period of 6 h) and the phase a is 
a slow varying variable with a typical time-scale of 70 days. 
We call X the random variable X = 1 if the star is not re- 
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observed after the first crossing at the start of a new epoch, 
and X = for the opposite outcome. If h is the transverse 
size of the field, it is easy to establish (see the Appendix) by 
conditioning the random variable to the transverse velocity 
and computing the expectation E(E(X|Vt)) that. 
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if 
if 



u > 1 



u < 1 



(1) 



where u — h/Vm&x At. When u > 1 the transverse mo- 
tion over At is always less than the width of the FOV while 
it can be larger otherwise when u < 1. For the transitions 
between the preceding to following FOV this yields a proba- 
bility P = 0.71 that more than one observation occurs and 
0.32 for the transition between the following and preceding 
field, with the same assumption that the observation in the 
first field is the first of a sequence. 

Table 2 gives the actual frequency of the different length 
of the sequence based on a simulation over 10 000 randomly 
distributed stars on the sky. The positions of these stars have 
been compared with the viewing directions over five years, 
thus producing for each star the time sampling in each field 
of view. The termination criterion for a continuous sequence 
of observations is activated when the interval between two 
successive observations is larger than few revolution periods 
of the satellite. A gap of 3 or 4 revolutions (~ 1 day) may 
happen in a short time sequence in the passage from MBP 
transits to BBP transits as a result of the relative location of 
their respective detectors as shown in Fig. 2. From trials and 
errors we have found that a time gap between two successive 
observations larger than 2.5 days was a good signature to 
isolate short observing sequences from very different observ- 
ing epochs. This means essentially that when one has a gap 
of 2.5 days without observation, the actual gap will be in 
practice much larger (~ 20 — 30 days), with few exceptions 
well accounted for by the idiosyncrasies of the scanning law. 

In the first two columns of Table 2, only the obser- 
vations in the astrometric fields have been monitored and 
sequences starting with the preceding FOV (sequences like 
P, PF, PFP, PFPF • ■ ■) or following FOV (sequences like F, 
FP, FPF, FPFP, ■ ■ ■) are counted separately. They are not 
symmetrical, because the occurrence of a sequence pfov — * 
FFOV is more likely than the opposite sequence. The typical 
sequence comprises two observations (PF) in the first group, 
while sequences of unit length (F) are the most common in 
the second group. The average length of the sequences is 
2.1 in the first case and 1.6 in the second. While the num- 
ber of long sequences is small (less than one percent of the 
sequences longer than 10 observations), the length of the se- 
quences can be very large and can reach 20 or 40 consecutive 
observations, meaning that the star is observed at regular 
interval for up to 5 days. 



3.1.2 Astrometric and spectroscopic fields 

We introduce now the field with the medium band filters 
(MBP fields of view). It is located ahead of the preceding 
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Figure 2. Relative size of the fields of view of Gaia. Many more 
field crossings arise in the medium band photometer (mbp) than 
in the broad band photometers (bbp) in the astrometric fields, as 
a result of the vertical extension. 
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Spectro/MBP 



Astro 1 (PFOV) 



Scan direction 



Figure 3. Relative positions of the viewing directions along the 
scan circle. MBP = Medium band photometry, PFOV (resp. 
FFOV) = preceding (resp. following) field of view. 



field, at 38 degrees as depicted in Fig. 3 and comprises two 
subfields symmetrically placed with respect to the scan di- 
rection as shown in Fig. 2. Its vertical extension is much 
larger than the astrometric fields, meaning that the prob- 
ability that a star is observed in this photometric field is 
higher than in the astrometric FOVs. The average number 
of observations is just above 200 in the medium band pho- 
tometer over the 5-year mission. The distribution with the 
ecliptic latitude (Fig. 4) displays the same symmetrical be- 
haviour as for the observations with the broad band filters. 

The transverse motion over one revolution is always 
smaller than the vertical extension of any of the two sub- 
fields, which implies that observations take place over at 
least two successive revolutions of the satellite. In fact, the 
average length of a sequence consists of 6-7 consecutive ob- 
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Figure 4. Number of measurements in MBP (top) and BBP 
(bottom) fields of view as function of ecliptic latitude b. The mean 
number of observations is 200 for the MBP and 80 for the BBP. 

servations within less than a day, with measurements in ev- 
ery bandpass. In this case all the sequences but a handful 
start with an observation in the MBP field and a typical 
sequence is MMPFMM, meaning two MBP measurements 
followed by observations in the preceding and following as- 
trometric fields and then again two in the MBP field. The 
probability distribution of the sequence lengths is given in 
the rightmost column of Table 2. Very long sequences of 
nearly 100 uninterrupted consecutive observations over ~ 
10 days may happen exceptionally. 

The distribution of the short intervals when observa- 
tions in the three photometric fields are combined is shown 
in Fig. 5. The most frequent interval is 0.25 days, corre- 
sponding to one spin period between two successive obser- 
vations in the medium band photometer and accounts for 
50 percent of the intervals. The origin of the other intervals 
can be read in Table 1, by adding in some instances one or 
two revolutions of 0.25 days. 

3.2 Long term succession 

The short term succession within an epoch is essentially 
driven by the relative position of the fields and the spin 
period, which control the time needed to rotate from one 
field to the next. The transverse motion which leads to the 
end of a sequence and later to the beginning of a new one is 
due to the slight displacement of the spin axis on the pre- 
cession cone of ~ 4 degrees per day. Therefore the return of 
an epoch of observation is primarily determined by the pre- 
cession motion and the yearly solar motion on the ecliptic. 

We have seen in the previous section, that the aver- 
age length of a sequence in the astrometric fields only, is 
very close to two. With a total of ~ 80 observations in total 
(Fig. 4), this gives about 40 epochs distributed over the 1800 
days of the mission, or a typical interval between epochs of 
45 days. If we take the combined observations in the two 
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Figure 5. Frequency distribution of the intervals between con- 
secutive observations performed at the same epoch in the three 
photometric fields of view. This indicates that the most common 
interval between paired observations is the 6 hours of the scan pe- 
riod. The lower panel is a close-up view of the overall distribution 
shown in the upper panel. 



Table 2. Frequency distribution of the number of observations in 
continuous sequences. In the 2nd and 3rd columns only transits 
in the astrometric FOV are considered (that is to say in the BBP 
filters) while the last column includes also the MBP filters. The 
table gives the frequency of sequences of different lengths, accord- 
ing as the first field in the sequence is the preceding or following 
astrometric field of view (col 2 & 3), while for the MBP, due to 
the size and position of the detectors, all sequences start with few 
consecutive observations in MBP, followed by transits including 
only BBP and MBP, before additional crossings of the MBP. 



Sequence starts with 



length 


PFOV 


FFOV 


MBP 


1 


0.27 


0.63 


0.00 


2 


0.60 


0.20 


0.00 


3 


0.05 


0.11 


0.06 


4 


0.04 


0.02 


0.13 


5 


0.01 


0.02 


0.33 


6 


0.01 


0.00 


0.22 


7 


0.00 


0.00 


0.07 


> 7 


0.01 


0.01 


0.19 
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photometers instead, we have 280 observations and an aver- 
age sequence of 6.5 observations, that is to say a return again 
every 45 days. However we have seen that the total number 
of observations per star is also function of the ecliptic lat- 
itude, as any other effect tied to the precessional motion. 
The actual number as a function of the latitude can only be 
obtained through a simulation with a fairly large number of 
stars. 

The frequency distribution is shown in Fig. 6 for the 
ecliptic region, the intermediate latitude and the extended 
polar cap. These distributions are very informative, and in- 
deed more relevant for the purpose of variable analysis than 
a weakly scattered distribution about the average interval 
would be. Below b — 40°, there is quite an extensive cov- 
erage between 5 to 150 days, with two holes in the ranges 
50-80 days and 100-130 days. At higher latitude, the scat- 
ter is less pronounced, with a smaller period between two 
epochs, meaning more observations. Clearly the period re- 
covery of variable stars will depend, not strongly indeed, on 
the ecliptic latitude just because of the time sampling. The 
evaluation of this feature is the main object of the coming 
sections of this paper. A similar conclusion applies to the 
periods of spectroscopic binaries determined from the ra- 
dial velocity measurements as found by Pourbaix & Jancart 
(2003). 



4 SIMULATIONS 
4.1 The time sampling 

As said earlier a catalogue of 10 000 directions uniformly dis- 
tributed has been produced. From the position and size of 
the different fields of view and the scanning law it was pos- 
sible to sample the 1800 days of the mission and determine 
at any time which regions of the sky are observable by each 
instrument. When a star lies in a field, one computes then 
accurately the date of the crossing of the central line of the 
field, together with its ordinate and transverse velocity and 
the index of the field. Many refinements are used to speed 
up the runtime by eliminating quickly many of the stars not 
observable within a day. The output is finally re-organised 
to produce chronological time series for each of the 10 000 
stars. This gives at the end a set of about 3 million observa- 
tions scattered over 5 years. This file of realistic observing 
sequences is then the basis of the photometric simulation 
and the variability analysis. 

The distribution of the number of individual observa- 
tions (an observation at a particular date corresponds to 
the crossing of one field of view) is shown in Fig. 4 as a 
function of the ecliptic latitude for the 10 000 sources of the 
simulation. There is a strong dependence resulting from the 
scanning law, with mid-latitude region over observed com- 
pared to the mean, while the ecliptic region is observed less 
frequently than the average. The same pattern is common to 
the main astrometric field, which includes the BBP filters, 
and to the MBP field. However due to the larger across-scan 
extension of the latter, about twice as many observations 
are made in the latter field. This appears clearly in Fig. 7 
which displays the frequency distribution of the number of 




Figure 6. Frequency distribution (as a function of the ecliptic 
latitude) of the intervals of time between epochs of observations 
in the three photometric fields of view. The recurrence is primarily 
driven by a combination of the precessional motion of Gaia spin 
axis and the yearly solar motion. 



observations in the same two fields of view. The scatter in 
each field is significant with variation from 40 to 150 in the 
astrometric fields and 100 to 300 in the MBP field. 

Quite as important as the number of observations is the 
number of different epochs at which the signal is sampled. 
The observations are clustered into small groups of ~ 2 to 
10 observations within a day, before a new cluster shows it- 
self after several weeks. The number of epochs based on the 
same simulation plotted in Fig. 8 shows that there are essen- 
tially two regimes: the ecliptic zone with ±30° with about 30 
observational epochs over the five years and the region of up- 
per ecliptic latitude, above 40°, with ~ 50 difl^erent epochs. 
This feature impacts significantly on the variability analysis 
probably more than the total number of observations, since 
it is directly related to the ability to recover the characteris- 
tics of periodic phenomena. Even after a normalization over 
the S/N values, there should remain a systematic effect with 
the ecliptic latitude. 
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Figure 7. Histograms of the number of measurements for the 
astro-fields (both following and preceding) and for the MBP field. 
The extended scatter towards the largest values comes from the 
observations around 40° of ecliptic latitude. 
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Figure 8. Number of clustered groups as function of ecliptic 
latitude b. The pattern is much simpler than that for the total 
number of transits (Fig. 4), with only two regimes : low and high 
ecliptic latitude. 

4.2 The photometric signal 

For each star one generates the photometric variability, mod- 
elled by a simple sinusoidal signal with one frequency. More 
complex light curves have been tested with several harmon- 
ics representing non-sinusoidal oscillations of pulsating vari- 
ables or eclipsing binaries without affecting the main conclu- 
sions of this study, although the case of the eclipsing bina- 
ries is the source of specific problems. Therefore our baseline 
simulated signal has the form, 

s{t) = So + ylsin(27rz/t + (j)) + noise 



where A is the amplitude of the variability, z/ the fre- 
quency, (f> the phase. One should note that variable stars arc 
usually characterised by the amplitude of variability, that is 
to say by the peak to peak amplitude 2* A. The noise added 
to the deterministic signal is a Gaussian random variable 
of zero mean with a standard deviation determined by the 
signal-to-noiso ratio of each experiment. Although we have 
also experimented with multi-periodic signals, the results 
presented here refer to purely periodic variables. 

All the simulations are scaled by the S/N ratio (de- 
fined here as ^/ noise) . This is a convenient parameter which 
makes our results independent of the star magnitude. How- 
ever as a result of the systematic variation of the number 
of observations with the ecliptic latitude, the dominant ef- 
fect in the variability analysis beyond the S/N ratio, is the 
ecliptic latitude, lu order to avoid this bias, wo have used a 
normalised definition of the signal-to-noisc ratio, by dividing 
the single observation noise in S/N by n^''^. More precisely 
we have, 

X = S/Nx ^AV80 (2) 

so that X ~ 1 for an average observation with S/N = 1, 
while at 6 ~ 40°, x = 1 is equivalent to S/N ~ 0.65. There- 
fore any remaining effect with the ecliptic latitude will arise 

more from the time sampling than from the systematic dif- 
ference in the number of observations. 

Simulated variables are generated for a grid of frequen- 
cies and X- For each point of this grid, and for all the sources 
a period search is performed on the signal, without any a pri- 
ori information, using primarily an optimised version of the 
Lomb-Scargle algorithm (Press et al., 1992) or a frequency 
analysis based on orthogonal decomposition developed by 
one of us (FM). The rate of success in the period retrieval 
for every grid point is the main output of the program. Re- 
garding the size of the computation, we have sampled about 
10 periods (from the slowest Miras to the shortest pulsat- 
ing variable with periods of few hours) and between 3 and 
5 values of the normalised S/N. Each of the 10 000 stars 
(or 5000 for the runs limited to one hemisphere) has been 
used 5 times, with independent random noise on the obser- 
vations. This gives in total 50 000 period searches for every 
grid point. In total the number of computed "periodograms" 
reaches nearly 1.4 million. 



5 RESULTS 

5.1 Rate of detection 

Results consist primarily of correct rate of detection as a 
function of the period, the signal-to-noise ratio and the po- 
sition on the sky, this as a result of the inhomogeneous time 
sampling of Gaia. Fig. 9 gives for a single grid point {P = 0.2 
day and the unnormalised S/N = 0.75) the sky distribution 
of the correct rate of period retrieval. The main pattern 
reproduces the scatter of the number of observations with 
the ecliptic latitude, which has been subsequently eliminated 
with the normalised S/N. In the range of latitude around 
40° the rate is close to 80 percent, and between 20 and 
50 percent outside this area. The rate falls off very quickly 
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Figure 9. Rate of correct detections for S/N = 0.75 and a single 
period of 0.2 days, using only the astrometric fields. We are here 
in a range of small signal-to-noise ratio. For certain regions, the 
rate reaches nearly 0% and for other regions where the number of 
measurements is high, it reaches nearly 100%. The small effects 
in longitude follow from the non-uniform coverage observation 
density as of function of longitude over 5 years. 
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Figure 10. Rate of correct detection for normalised signal-to- 
noise ratios x (written on the right) and frequencies [cycle/day] 
(written on the left for the normalised signal-to-noise ratio of 0.9, 
the same colour / gray scale and line type are applied for the higher 
signal-to- noise ratios) . 

with lower signal-to-noise ratio, but on the other hand grows 
quickly to 100 percent for SjN > 1.5. 

Since there are only small secondary modulations with 
the ecliptic longitude, one can perform one dimensional anal- 
yses by looking at the dependence with the ecliptic latitude 
for various grid-points. Results are shown in Fig. 10. There 
are three sets of curves for x = 0.9, 1.1, 1.3 and different 
periods (8h, 1, 10, 100 and 330 days). 

• It is clear, that despite the normalisation of the S/N, 
the effect with the ecliptic latitude has not fully vanished, 
indicating an influence of the number of epochs (the number 
of clusters of observations separated by few weeks shown in 



Fig. 8). There is a systematic loss in the period recovery 
around b ~ 20°, not easily understood at the moment. 

• The rate of correct recovery grows quickly to 100 per- 
cent, with normalised S/N > 1.5. 

• Although the scatter is large, it seems that the shorter 
periods (< 1 day) are better recovered than the larger pe- 
riods. However the difference is not conspicuous and this 
comes as a surprising (and welcome) result, as it was thought 
that very short periods would be very difflcult to recover. 

5.2 Aliasing and false periods 

With irregular sampling, and within the range of periods 
analysed here, one should not expect too much aliasing (Eyer 
& Bartholdi, 1999), since the somewhat randomness of the 
sampling limits the building up of coherent signature at reg- 
ularly spaced frequencies. However, at low signal-to-noise 
levels, the recovery rate is low, and the search program may 
end up in a normal exit with no periodic signal found (the 
favourable outcome) or with the identification of a period 
(possibly wrong), without warning. It is then of interest to 
look at what kind of periods are found as a function of the 
true period. This is illustrated in Fig. 11 for the BBP (same 
field as the astrometric detector) (top) and the MBP fields 
(bottom). The simulated variable has a period of one day 
and S/N = 0.75 in this example. In the BBP, when the pe- 
riod is not recovered (about half of the cases tested) the final 
erroneous frequency can be almost anything, with a higher 
probability to frequencies of 11 and 13 cycles per day, com- 
ing from the convolution with the spectrum of the observing 
window. So, unless further assumptions, there is no way to 
decide from the output of the period search whether the 
period found is correct, although a significance level is eas- 
ily produced and could be used as a warning, but not as a 
measurement of the uncertainty of the period. In the case of 
the MBP, aliased results are clearly visible, regularly spaced 
with Itv = 2 cycles/day, corresponding to half of the satellite 
spin frequency (period of 6 hours). This comes from succes- 
sive observations regularly sampled every revolution within 
a cluster of observations, responsible for the aliasing and 
leading to a wrong determination of the period. This again 
cannot be avoided with the present design of the instrument, 
but fortunately limited to those observations with low SNR. 



6 CONCLUSION 

In this paper we have investigated the retrieval of periods of 
variable stars from observations to be carried out during the 
Gala mission. We have found, that with adequate frequency 
analysis algorithms, one can recover period of regular vari- 
able stars even with signal-to-noise ratio around unity. It is 
also shown that the very irregular time sampling expected 
from Gaia allows to determine frequencies much higher than 
the upper frequency estimated from a straightforward eval- 
uation of the Nyquist boundary (cf. Eyer & Bartholdi 1999), 
and with virtually no aliasing. These two features are very 
good news for the preparation, and subsequent scientific ex- 
ploitation, of the mission. 
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Figure 11. Example of terminal frequencies for an initial signal 
of S/N = 0.75 and frequency = 1.000. It reflects the pattern of 
individual periodogram, which bears the signature of the spectral 
window. 
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APPENDIX : DERIVATION OF EQUATION 1 

Let a star observed in one field and crossing the field in 

the along-scan direction at the ordinate y. The field height 
is h and < y < h during the transit. This crossing may 
happen at any ordinate with a uniform probability, so that 
P(Y < y) = y/h. After an interval of time At (one of the 
possible intervals between the preceding and following fields 
listed in Table 1) the rotation of the satellite has brought 
the line of sight of the second field in the same direction 
and the star may be observed again, provided its transverse 
motion during At has not been too large. Mathematically 
the star cannot be observed again if y + Vt At > h, meaning 
that its new ordinate is larger than the field height (we have 
restricted to the case Vj > to simplify the discussion, but 
this does not alter the final result for symmetry reasons). 
Therefore the probability that the star is not observed in 
the second field when the transverse velocity is Vt is. 



However, from this investigation we have noted that 
the frequency analysis without initial assumption on the fre- 
quency range may be very demanding in term of computer 
resources, hardly compatible with the goal of analysing from 
scratch ~ 10* sources. Therefore a major effort is needed 
during this preparatory phase to develop optimised software, 
to estimate quickly and efficiently possible periods or ranges 
of periods, before a thorough and accurate search can be 
launched on a well bounded frequency range. An estimate 
of the level of significance of each peak height should be also 
a built-in feature, alongside the possibiUty of searching for 
harmonics or a second independent period. 
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P((X = 1)|F,) 



if Vt At>h 



P{Y >h-Vt At) 



Vt At 
h 



if 



Vt At<h 



In the first case the transverse velocity is large enough for 
the transverse motion during At to be greater than the field 
width. In the second case there exists a region in the initial 
field of view where even after the time At the ordinate of 

the star is still less than the field width. The above proba- 
bilities are conditioned on the transverse velocity Vt and one 
must now introduce its random distribution. Assuming that 
the star can be anywhere on the scan circle with a uniform 
probability, one has Vt — Vrnax sin(0) where the maximum 
velocity Vrnax ~ 0.170 deg/h and <j) is uniformly distributed 
between and 27r. The final probability is just the average 
of the conditional probability over the distribution of the 
transverse velocity (restricted here to the positive values). 
The case when Vrnax At/h < 1 is very simple with 



P(X = 1) = 



2 V^max At 

TT h 



Jo 



sin (f>d<p = 



2 Vn 



At 



In the second case the integration extends up to (pm such 
that Vmax At sin <j)m = h, or (f)m = arcsin(/i/Vmax At), and 
beyond this angle the conditional probability is one. This 
yields for Vrnax At/h> 1, 

giving with u = fe/( Vrnax At) (hence w < 1 here). 



1 arcsin u 

7r 



which is the same as Eq. 1. 
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